DUALITY OF ORTHOGONAL 
POLYNOMIALS ON A FINITE SET 

Alexei Borodin 

Abstract. We prove a certain duality relation for orthogonal polynomials defined on 
a finite set. The result is used in a direct proof of the equivalence of two different ways 
of computing the correlation functions of a discrete orthogonal polynomial ensemble. 

Introduction 

This note is about a certain duality of orthogonal polynomials defined on a finite 
set. If the weights of two systems of orthogonal polynomials are related in a certain 
way, then the values of the nth polynomial of the first system at the points of the set 
equal, up to a simple factor, the corresponding values of the (M — n)th polynomial 
of the second system, where M is the cardinality of the underlying finite set. 

We formulate the exact result and prove it in §1. 

In §2 we explain the motivation which led to the result. We compare two different 
ways to compute probabilistic quantities called correlation functions in a certain 
model. The model is a discrete analog of the orthogonal polynomial ensembles which 
appeared for the first time in the random matrix theory, see, e.g., [Dy], [Ga], [GM], 
[Me], [NW]. Discrete orthogonal polynomial ensembles were discussed in [BOl], 
[B02] [B03], [J1]-[J3]. 

The results of the two computations must be equal, but this is not at all obvious 
from the explicit formulas. Our duality relation provides a proof of the equivalence 
of the two resulting expressions. 

In §3 we consider 2 examples when the orthogonal polynomials are classical 
(Krawtchouk and Hahn polynomials) . In these cases the duality provides relations 
between similar polynomials with different sets of parameters. The relations are 
also easily verified using known explicit formulas for the polynomials. 

I am very grateful to Grigori Olshanski for numerous discussions. I also want 
to thank Tom Koornwinder for providing me with his computation regarding the 
Hahn polynomials, see §3. 

1. Duality 

Theorem 1. Let 

X = {x ,xt, . . . ,x M } C R 

be a finite set of distinct points on the real line, u(x) and v(x) be two positive 
functions on X such that 

U{x k )v(x k ) = — — !— rj, fc = 0, 1,...,M, (1) 

lli/fcV x * x i) 
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and Pq, Pi, . . . , Pm and Qq, Qi, . . . , Qm be the systems of orthogonal polynomials 
on X with respect to the weights u(x) and v(x), respectively, 

degP t = dcgQ l = i, 

M M 

^Pi{x k )Pj(x k )u(x k ) = S ij p l , ^2Qi(x k )Qj{x k )v(x k ) = Sijqi, 

k=0 k=0 

Pi = (nx 1 + lower terms, Qi = biX 1 + lower terms. 

Assume that the polynomials are normalized so that hi = PM-i/ 'clm-% for all 
i = 0,l,...,M. Then 



Pi{x)^/u{x) = e{x)Q M -i{x)y/v{x), x e X, 
a-ib M -i = Pi = QM-i, i = 0, 1, . . . , M, 

where 

e(zfc) = sgn]J(x fc - Xi), fc = 0,l,...,M. 

Proof. Let us start with one system of polynomials, say, {Pi}, and define a sequence 
of functions {Qi} on X by the equalities 



Qi(x k ) = e(x k )P M -i(x k )J — ■ — r = Y\(x k - x^ ■ P M -i(x k )u(x k ) . 



i*k 



Then 

M M 

S ^Q l {x k )Q j {x k )v{x k ) = ^ PM-i{x k )PM-j{x k )u{x k ) = SijPM-i, 
k=0 k=0 

so the functions {Qi}^L are pairwise orthogonal with respect to the weight v(x), 
and qi = \\Qi\\l = pm-i- 

Consider the interpolation polynomial Qi(x) of degree M such that Qi{x) = 
Qi(x) for all x G X. We have (the hat means that the corresponding factor is 
omitted) 

Qi(x) = £ Qi(x m ) . {X - Xo) ' ' ' • • • ( * - Xm) 



m=0 (x m Xq) ■ ■ ■ (x m X m )-..{x - xm) 

M 

= ^2 P M -i(x m )u(x m ) -(x-Xo)---(x- X m ) ■■■(X- X M )- 

The coefficient of x n of such polynomial equals 

M 

(-1) M "" ]T PM-i(Xm)u(x 
m=0 
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where e s are the elementary symmetric functions: 

e a (yo,yi,---)= X! 2/»iJ/»2 ••■?/». • 

0<u<i 2 <---<i 3 

Denote e s (xo, . . . ,Xm) by E s . Note that Eq = 1 by definition. An application of 
the inclusion-exclusion principle shows that 

e s (x , ...,x^,.. .,x M ) =E S - x m E s _i +x 2 m E s - 2 h {-l) s x s m . 

Then the coefficient of x n in (x) equals 

M 

(-1) M "" J] ^M-^„>(* m ) (£m-„ - a; m £7 M _ n _i + • • • + (-l) M -"x^"") 

= (-l) M - n E M _ n (P M _ i ,l) + (-l) M - n - 1 E M -n-l{PM- i ,x) + - ■ - + {PM-i,X M - n ). 

But the orthogonality of Pj's implies that (PM-i 1 x r ) = for r < M — i, and 

IP „M-i\ \\ P M-i\\ 2 PM-i 



For any positive function w(x) on X denote by P^ the probability measure on 



O-M-i O-M-i 

This immediately implies that Q, is a polynomial of degree i with the leading 
coefficient 6j = PM-i/ a M-i- D 

2. Probabilistic interpretation 

Recall that X = {x , . . . , xm} is a finite subset of the real line. 

For any to = 1, . . . , M, denote by the set of all subsets of X with to points: 

X {m) = {{a^ , . . . , x lm } | < h < ■ ■ ■ < i m < M} . 

Ave function it 
X( m ) defined by the formula: 

m 

'P^ n) {x il ,...,x im } = const Yl ( X ik - x iif ■ II W ( X ik)- 

l<k<l<m fe=l 

Also denote by P^ the probability measure on X^ defined by the relation: 
P ( ™\a) =P^(X\A), Ael< m ». 

The next claim was essentially proved in [BO 3]. 
Proposition 2. Let u(x) and v(x) be two positive functions on X satisfying (1). 

rpj <r>(m) ^(M-m+1) 

i ften / j m = P v for any m = 1, . . . , M . 

Proof. For arbitrary finite sets B and C we will abbreviate 

n(B) = ± [J (x-y), n(s,c)= [] (^-f)- 

x,yEB xeB.yeC 
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The sign of is inessential. 

Take A = {x n , . . . , x im } G X< m ). We have 

m 

^ m) (A) = const | j {x lk - x H f ■ Y[ u(x ik ) = const -IT 2 (A) ■ J[ u(x). 

l<k<l<m k=l xeA 

Further, 

Y1(A) = ±U(X \ A) ■ IT 2 {A)U(A, X\A) 



TL(A)TL(X \ A)U(A, X\A) 
But TL(A)IL(X \ A)U(A, X\A)= n(X) = const, and 



U 2 (A)U(A,X\A)=±H 

xeA 



I 

,yex 



Hence, using (1), we get 



H 2 (A) ■ Yl u(x) = const -U 2 {X \ A) ( J| v(x)\ 

xeA \xeA / 

= const -U 2 (X \ A) ■ V ^ = const' -II 2 (X \ A) ■ ]J v(x), 

ilxex vl y x ) xeXXA 

where const' = const ■ (J\ xeX v(x)) 1 . Thus, and V^ M m+1 ^ differ by a 

multiplicative constant. Since both Vi m) and p[ M m ~ are probability measures, 
they must coincide. □ 

Let fi be an arbitrary probability measure on the set of all subsets of X. Note 
that any probability measure on X^ can be trivially extended to a measure on 
the set of all subsets of X. 

For any n = 1,2,..., M , we define the nth correlation function of u 

/OnH/i) :* (n) -R>o 

by the formula 

p n (A\ fJ ,)= £ (J,(B). 

BDA 

In other words, p n {A \ fi) is the probability (with respect to p) that the random set 
B contains a fixed set A G X^ n \ 

Below we use the notation of Theorem 1 for the orthogonal polynomials associ- 
ated with the weights u(x) and v(x). 
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Proposition 3. For any m = 1, . . . , M, the correlation functions ofV^ have the 
form 

p n ({x n ,...,x in } | ?W) = dot \k™ (x ik , x it ) 



kd—l,...,n 



where 

Proof. A standard argument from the random matrix theory, see, e.g, [Dy], [Me, 
5.2]. □ 

Note that if n > m then the nth correlation function of vanishes iden- 

tically. Indeed, all sets with more than m points have measure zero with re- 
spect to Vu 1l \ Another way to see the vanishing is to observe that the matrix 
\\K^ n \xi, arj)||j, i j=o,...,M has rank m. Thus, its n x n minors expressing p n ( ■ \ V^) 
must vanish if n > m. 

Similarly, for any m = 1, . . . , M, the correlation functions of have the form 



fc,Z = l,...,7Z 



Pn ({x n ,. ..,x in } I ?<"■)) = det [K^ (x lk ,x H ) 
where 

The determinantal formulas for the correlation functions above imply that 
and "Pv™^ belong to the class of determinantal point processes, see [Ma], [DVJ, 5.4], 
[BOO, Appendix], [So] for a general discussion of such processes. 

Proposition 4. For any m = 1, . . . , M, the correlation functions ofV^f 1 have the 
form 

Pn({^in • • • ,£i„} I V u ) = det K u (Xi k ,Xi l ) , 

L J k,l— l,...,n 



where 

(x,y)=6 xy -K™ (x,y). 
Here 5 xy is the Kronecker delta. 

Proof. By the definition of "P^™ , we have 



Pn (A\rT > )=J2' p u m Hx\B)= £ ^ m) (c). 



BDA C( -x 

cr\A=0 



The inclusion-exclusion principle, see, e.g., [Ha, 2.1], gives 

E ^ m) (c)=Y,(- 1 ) lDl p\D\(D\pi 

CdX DcA 
CnA=0 
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By Proposition 3, the expression on the right-hand side is equal to the alternating 
sum of all diagonal minors of the matrix \\K^ n \x,y)\\ Xyye A- By linear algebra, this 
is equal to det[8 xy ~ Ki m \x 1 y)] x ^ eA - □ 

Similarly, for any m = 1, . . . , M, the correlation functions of "Py have the form 



jOn({x ii; . ..,x in }\ P ( v m) ) = det K^"' {x lk ,x lt ) 

where 



k,l — l,...,n 



K[ m) {x,y)=5 xy -K™ (x,y). 

Proposition 4 is a special case of the complementation principle for the discrete 
determinantal processes which is due to S. Kerov, see [BOO, A. 3]. 

Observe that Proposition 2 and Propositions 3 and 4 with similar statements 

regarding vi" 1 ^ and , imply that all the diagonal minors of the matrix 

= \\Ki m \x i ,x j )\\iJ=o,..,M 
are equal to the corresponding diagonal minors of the matrix 

In particular, the diagonal entries of these two matrices are equal. Looking at 2x2 
diagonal minors, we then conclude that 

Ki m Hx,y) = ±Ki M - m+1 \x,y) 

for all i/j, (Here we used the fact that both matrices are symmetric.) 

An obvious guess is that the matrices and I — Ki M m+1 ) are conjugate, 

and the conjugation matrix is diagonal with diagonal entries equal to ±1. This 
guess turns out to be correct. 

Set 

D = diag(e(a; ), e(xi), e(x M )), 
where e(x) was defined in Theorem 1. 

Theorem 5. Under the above notation, for any m = 0, 1, . . . , M, 

= D(I - Kl M - m+ ^)D, 
where the functions u and v satisfy (1). 

Proof. The equality of the diagonal entries was discussed above: it is exactly the 
equality of the first correlation functions of the processes Vu"^ and m+1 ^ se e 

Propositions 2, 3, 4. To prove the equality of the off-diagonal entries we employ 
the well-known Christoffel-Darboux formula, see, e.g., [Sz], which implies that, for 

x^y, 

P m {x)P m -i(y) - P m -i(x)P m (y) 



K<r\x,y) = y/u{x)u{y) 



flmPm-l X-y 
OM-m 



Ki M - m+1 \x, y ) = ^vTxW^, 

— m 

QM-m+l(x)QM-m(y) — Q 'm -m(x)Q M -m+l(y) 

x-y 

Then Theorem 1 immediately implies that (x, y) = —e(x)e(y)Ky M m+1 ) ( Xj y) ; 
and the proof is complete. □ 
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3. Examples 

Our main reference for this section is [KS]. We use it for the notation and data 
on the classical orthogonal polynomials considered below. 

3.1. Krawtchouk polynomials. Let X = {0, 1, ... , TV}, and 

u{x) = (i - P f-* = x)[ p*{i-p) n -*, xex, o< P <i. 

The polynomials orthogonal with the weight u(x) are called the Krawtchouk poly- 
nomials, see [KS, 1.10], 

P n (x) =K n {x;p,N), n = 0,l,...,N. 

The leading coefBcicnt a n of P n , the square of the norm p n of P n , and the explicit 
formula for P n are as follows: 



-1 / tSr, /Art -i . 

n, —x 



Observe that 



11 (x-yf = x\ 2 (N-x)\ 2 , x = 0,l,...,N. 



y=0,...,N 

Thus, the dual (according to Theorem 1) weight v{x) has the form 

„(*) = Hx)x? { n - xy.r 1 = mp( l_ p))N ( N x ) (i - pTp n - x - 

We conclude that Q n (x) = const K n (x; 1 — p,N). An easy calculation shows that 
the normalization of Theorem 1 implies that 

const = (-1)^(1 - p) N N\, Q n (x) = (-l) N (l - p) N N\K n (x; l-p,N). 

Clearly, e(x) = (—1) N ~ X , and the claim of Theorem 1 takes the form 

K n (x;p,N) = (-l) x {j-^j K N - n (x;l-p,N), x = 0,...,M. (2) 

Of course, this identity can be proved directly using the explicit formula for the 
Krawtchouk polynomials above. One just needs to use the transformation formula 
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3.2. Hahn polynomials. The computation below was shown to me by T. Koorn- 
winder. Let X be as above, and 



u(x) 



a + x\f(3 + N-x 
N-x 



a, (3 > -1 or a, (3 < -N. 



If a,/3 > -1 then u(x) > 0, if a, {3 < -N then (-l) N u(x) > 0. 

The orthogonal polynomials corresponding to this weight are called the Hahn 
polynomials, see [KS, 1.5], 



P n (x) = H n (x;a,f3,N), n = 0, 1, 
The data are as follows: 



(n + a + /3 + l) n 
(a + l) n (-N) n ' 



Pn 



= (-l)"(n + a + p + \)n+i{P + l)»n! 
(2n + a + /3 + l)(a + l) n {-N) n N\ 

n, n + a + + 1, — a; 
a+1, -iV 



The dual weight has the form 

«(x) = {u(x)x\ 2 (N -xY 2 )- 1 



(-1) 



iY 



(a+l) w (/3 + l)jv 



(-0 - N - 1) + 2^ ^(-a - JV-l) + iV-a; 

iV-a: 



Thus, Q n (x) = const H n (x; —[3 — N — 1, —a — N — l,iV). Computation of the 
normalization constant yields 

const = Q n ( a ) = (-l^^+l)^^; -0-N-1, -a-N-l,N). 

The claim of Theorem 1 takes the form 

H n (x; a, (3, N) = ^ ~ ^ x H N ^ n (x; -f3 - N - 1, -a - N - 1, N) (3) 
{a + l) x 

for all x = 0, 1, . . . , N. 

A direct proof of (3) follows from the transformation formula 



3^2 



a, b, c 
d, e 



= T(d)T(d + e-a-b-c) 
T(d + e-a-b)T(d-c) 3 2 



e — a, e — b, c 
d + e — a — b, e 



see [PBM, 7.4.4(1)], [Ba, 3.6]. 

The limit transition a = pt, f3 = (1 — p)t, t — > 00, see [KS, 2.5.3], brings (3) to 
(2). 
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